ggml-cuda : add rope f16, restore performance with parallel decoding #3272
+110
−67
We went looking everywhere, but couldn’t find those commits.
Sometimes commits can disappear after a force-push. Head back to the latest changes here.